Update Dashboard addon to version 1.8.0 and align /ui redirect with it #53046

maciaszczykm · 2017-09-26T10:00:07Z

What this PR does / why we need it: In Dashboard 1.8.0 we have introduced a couple of changes (security, settings, new resources etc.) and fixed a lot of bugs. You can check release notes at https://github.com/kubernetes/dashboard/releases/tag/v1.8.0.

Which issue this PR fixes (optional, in fixes #<issue number>(, fixes #<issue_number>, ...) format, will close that issue when PR gets merged): fixes #

Special notes for your reviewer:

Release note:

Updated Dashboard add-on to version 1.8.0: The Dashboard add-on now deploys with https enabled. The Dashboard can be accessed via kubectl proxy at http://localhost:8001/api/v1/namespaces/kube-system/services/https:kubernetes-dashboard:/proxy/. The /ui redirect is deprecated and will be removed in 1.10.

maciaszczykm · 2017-09-26T10:15:31Z

/assign @lavalamp

roberthbailey · 2017-10-09T21:08:39Z

@bryk - can you take a look?

floreks · 2017-10-10T07:19:33Z

We are pretty sure that failed tests are related to issue #53382.

maciaszczykm · 2017-10-16T20:07:33Z

@roberthbailey With @floreks we have managed to fix failing tests. Can you take a look?

maciaszczykm · 2017-10-16T20:08:19Z

The issue mentioned earlier was fixed by a change in our initial container.

roberthbailey · 2017-10-16T21:38:19Z

cluster/addons/dashboard/dashboard-controller.yaml

@@ -31,12 +36,26 @@ spec:
            memory: 100Mi
        ports:
        - containerPort: 9090


why didn't this port change if everything is shifting to 8443?

It should be changed. This option should expose port 9090 of this container, right? Is this overridden by using expose option in dockerfile? This container will actually only expose port 8443.

Missed it before. Fixed now.

roberthbailey · 2017-10-16T21:41:29Z

Please squash your commits.

/assign @mikedanese

to look at the rbac changes.

mikedanese · 2017-10-16T22:43:33Z

cluster/addons/dashboard/dashboard-rbac.yaml

+rules:
+- apiGroups: [""]
+  resources: ["secrets"]
+  verbs: ["create", "watch"]


what is the watch used for? cc @kubernetes/sig-auth-pr-reviews

Not something I'd recommend allowing (this is equivalent exposure to listing all secrets in the namespace). If the dashboard was in its own namespace, this would still not be ideal, but could be more palatable, but in kube-system, it's not a reasonable default policy

I'd love to restrict it even further but there is no option to define rule to watch on a single resource changes. Dashboard is actually only watching on single dashboard exclusive resource (secret named kubernetes-dashboard-key-holer). https://github.com/kubernetes/dashboard/blob/master/src/app/backend/sync/secret.go#L169

Since we are not exposing any endpoint that could allow to exploit that, then only stealing token from inside the pod would be an option to somehow exploit that permission.

PS. It is still a huge step forward for us from full cluster admin permissions that were granted previously. We'll update them if at some point it will be able to restrict rule to watch on single resource.

Until individual watch authz is available, you can do individual gets of the secret or mount it into the dashboard pod and react to changes in the mounted content

Agree with liggitt. Don't use watch, just poll with gets. This is what kubelet does.

We could use multiple decryption keys but how does this solve the issue of syncing them with all replicas?

Let's consider the use case where Dashboard is behind load balancer and there are more than 1 replica. Let's say that user has logged in and token was encrypted with locally synchronized key of backend-1. Then user gets redirected to backend-2 without knowing that and it might not have this key synchronized yet if we use polling mechanism with i.e. 5 min period. In result user gets forced logged out.

Second use case. We have 1 replica, key is sychronized with a secret. Secret gets deleted manually. In the meantime Dashboard is scaled up to 2 replicas. Second replica can not find secret and generates new encryption key, and stores it in a secret. Because of polling we have now 2 replicas with different keys that will be out of sync for a few minutes.

Currently when secret gets deleted it is immediately recreated based on local copy stored in one of the replicas.

Then user gets redirected to backend-2 without knowing that and it might not have this key synchronized yet if we use polling mechanism with i.e. 5 min period. In result user gets forced logged out.

secret contains [key1], all replicas use key1 for encrypting and decrypting

update secret to contain [key1, key2]. as replicas observe the new secret, they use key1 for encrypting and attempt decrypting with key1 and key2

wait at least as long as your secret distribution period, then update the secret to contain [key2, key1]. as replicas observe the new secret, they use key2 for encrypting and attempt decrypting with key2 and key1

wait as long as your cookie expiration period (so cookies created using key1 would no longer be valid), then update the secret to contain [key2]

the wait at step 2 is required to let all replicas observe the new decryption key before starting to use it. alternately, they could react to decryption failures by repolling the secret to see if there is a new key for them to use.

the wait at step 3 is required to avoid logging out users that logged in and have a session that can only be decrypted by the previous key

Right now we do not have key rotation mechanism implemented, however there is a fallback mechanism that forces synchronous update of secret in case decryption fails.

Still with current implementation I think polling would not work and it would have to be extended to support storing multiple keys in a secret (even that would need some rework to work properly).

Case in which this would not work with polling is:

Start with 1 replica, it generates and creates a secret with key-1.

Secret gets deleted.

Scale replicas to 2. New replica creates secret with a new key-2. It does not have information about old key-1.

Request goes to 2nd replica. Token encrypted with key-1 can not be decrypted with new key-2. User is logged out.

If points 2-4 happen during polling interval and new replica won't be able to synchronize both keys then there is a problem. Currently this problem is very unlikely to happen as thanks to watch secret gets immediately recreated from local copy.

Hi, how should we proceed to get it merged? Should we implement behaviour described by @liggitt, move it to another namespace (can Dashboard be cluster-service then?) or do @floreks concerns sound reasonable and there is another way to go?

Case in which this would not work with polling is:

Start with 1 replica, it generates and creates a secret with key-1.

Secret gets deleted.

Scale replicas to 2. New replica creates secret with a new key-2. It does not have information about old key-1.

Request goes to 2nd replica. Token encrypted with key-1 can not be decrypted with new key-2. User is logged out.

Yes, deleting state disrupts rolling update. The same thing would happen with watch (unless you had replica-1 repopulate the secret with potentially old keys, which I wouldn't expect if the secret is supposed to be the authoritative shared state).

Should we implement behaviour described by @liggitt

Moving to polling seems reasonable for such a slow-moving object, especially given the security tradeoff of granting complete access to all kube-system secrets. You could even do a rate-limited re-poll if a decode error was encountered to stay responsive to key changes on demand.

move it to another namespace (can Dashboard be cluster-service then?)

That would be ideal, but I think the add-on manager only targets the kube-system namespace today

do @floreks concerns sound reasonable and there is another way to go?

In order for existing user sessions to continue working, and preserve the ability to scale up/down replicas, you have to keep old decrypting keys available in shared state (in the secret) as long as your user sessions last.

liggitt · 2017-11-28T19:58:01Z

/retest

liggitt · 2017-11-29T14:36:28Z

/lgtm

roberthbailey · 2017-11-29T18:15:07Z

/approve no-issue

enisoc · 2017-11-29T20:17:27Z

This has been approved for an extension until the end of Dec 1.

k8s-github-robot · 2017-11-29T20:19:06Z

[MILESTONENOTIFIER] Milestone Pull Request Current

@bryk @lavalamp @liggitt @maciaszczykm @mikedanese @roberthbailey @zmerlynn

Note: This pull request is marked as priority/critical-urgent, and must be updated every 1 day during code freeze.

Example update:

ACK.  In progress
ETA: DD/MM/YYYY
Risks: Complicated fix required

Pull Request Labels

sig/ui: Pull Request will be escalated to these SIGs if needed.
priority/critical-urgent: Never automatically move pull request out of a release milestone; continually escalate to contributor and SIG through all available channels.
kind/feature: New functionality.

Help

bryk · 2017-11-30T10:00:56Z

@floreks @maciaszczykm Can you work on getting approvals from
hack/OWNERS
pkg/routes/OWNER?

floreks · 2017-11-30T10:08:28Z

/assign @lavalamp

Could you take a look?

bryk · 2017-11-30T10:12:13Z

ACK. Needs OWNERS approval
ETA: When get LGTM
Risks: none

maciaszczykm · 2017-11-30T11:36:12Z

@deads2k @sttts @lavalamp Could one of you take a look?

maciaszczykm · 2017-12-01T08:33:24Z

ACK. Needs OWNERS approval
ETA: When get LGTM
Risks: none

deads2k · 2017-12-01T14:03:16Z

The apimachinery is no worse than it was before. Thanks for noting it will be removed in a later release.

/approve

k8s-github-robot · 2017-12-01T14:03:23Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: deads2k, liggitt, maciaszczykm, roberthbailey

Associated issue requirement bypassed by: roberthbailey

The full list of commands accepted by this bot can be found here.

Needs approval from an approver in each of these OWNERS Files:

~~cluster/OWNERS~~ [roberthbailey]
~~hack/OWNERS~~ [deads2k]
~~pkg/routes/OWNERS~~ [deads2k]
~~test/OWNERS~~ [deads2k,liggitt]

You can indicate your approval by writing /approve in a comment
You can cancel your approval by writing /approve cancel in a comment

k8s-github-robot · 2017-12-01T14:03:51Z

/test all [submit-queue is verifying that this PR is safe to merge]

k8s-github-robot · 2017-12-01T14:40:47Z

Automatic merge from submit-queue. If you want to cherry-pick this change to another branch, please follow the instructions here.

…#53046-upstream-release-1.8 Automated cherry pick of #53046

k8s-ci-robot added size/M Denotes a PR that changes 30-99 lines, ignoring generated files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. labels Sep 26, 2017

k8s-github-robot assigned roberthbailey and zmerlynn Sep 26, 2017

k8s-github-robot added the release-note Denotes a PR that will be considered when it comes time to generate release notes. label Sep 26, 2017

maciaszczykm mentioned this pull request Sep 26, 2017

Finish 1.7.1 release (update at core repo) kubernetes/dashboard#2395

Closed

6 tasks

k8s-ci-robot assigned lavalamp Sep 26, 2017

k8s-ci-robot added size/L Denotes a PR that changes 100-499 lines, ignoring generated files. and removed size/M Denotes a PR that changes 30-99 lines, ignoring generated files. labels Sep 27, 2017

maciaszczykm mentioned this pull request Sep 28, 2017

localhost:8001/ui is directing to wrong URL kubernetes/dashboard#2419

Closed

maciaszczykm changed the title ~~Update Dashboard addon to version 1.7.0~~ Update Dashboard addon to version 1.7.1 Oct 3, 2017

floreks mentioned this pull request Oct 3, 2017

Init container not finished properly when resource limits too low #53382

Closed

maciaszczykm mentioned this pull request Oct 9, 2017

keep dashboard in burstable QoS #49191

Closed

roberthbailey assigned bryk Oct 9, 2017

This was referenced Oct 12, 2017

Remove /ui/ redirect #53766

Merged

Document /ui alias breakage on kubernetes 1.6.x kubernetes/dashboard#2465

Closed

k8s-github-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Oct 16, 2017

maciaszczykm force-pushed the dashboard-1.7.0 branch from e9a1a36 to 8794c1c Compare October 16, 2017 12:48

k8s-github-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Oct 16, 2017

maciaszczykm force-pushed the dashboard-1.7.0 branch from db88c31 to dc866d8 Compare October 16, 2017 14:22

roberthbailey reviewed Oct 16, 2017

View reviewed changes

k8s-ci-robot assigned mikedanese Oct 16, 2017

mikedanese reviewed Oct 16, 2017

View reviewed changes

maciaszczykm force-pushed the dashboard-1.7.0 branch from dabcc19 to 53cecab Compare October 17, 2017 07:20

k8s-ci-robot assigned liggitt Nov 29, 2017

k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Nov 29, 2017

enisoc added priority/critical-urgent Highest priority. Must be actively worked on as someone's top priority right now. status/in-progress and removed priority/important-soon Must be staffed and worked on either currently, or very soon, ideally in time for the next release. labels Nov 29, 2017

enisoc added this to the v1.9 milestone Nov 29, 2017

k8s-github-robot removed the milestone/removed label Nov 29, 2017

floreks mentioned this pull request Nov 30, 2017

Update Dashboard secret instead of deleting and recreating kubernetes/dashboard#2629

Closed

k8s-github-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Dec 1, 2017

k8s-github-robot merged commit 3bbe9ba into kubernetes:master Dec 1, 2017

maciaszczykm deleted the dashboard-1.7.0 branch December 1, 2017 15:08

maciaszczykm mentioned this pull request Dec 4, 2017

Automated cherry pick of #53046 #56793

Merged

spiffxp mentioned this pull request Dec 6, 2017

[e2e failure] [k8s.io] Kubernetes Dashboard should check that the kubernetes-dashboard instance is alive #56877

Closed

jpbetz added a commit that referenced this pull request Dec 6, 2017

Merge pull request #56793 from maciaszczykm/automated-cherry-pick-of-…

8424bd8

…#53046-upstream-release-1.8 Automated cherry pick of #53046

karataliu mentioned this pull request Dec 19, 2017

Support https for kubernetes-dashboard Azure/acs-engine#1947

Merged

liggitt mentioned this pull request Jan 5, 2018

bump dashboard version to 1.8.1 and add owner file #57874

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update Dashboard addon to version 1.8.0 and align /ui redirect with it #53046

Update Dashboard addon to version 1.8.0 and align /ui redirect with it #53046

maciaszczykm commented Sep 26, 2017 •

edited by jpbetz

maciaszczykm commented Sep 26, 2017

roberthbailey commented Oct 9, 2017

floreks commented Oct 10, 2017

maciaszczykm commented Oct 16, 2017

maciaszczykm commented Oct 16, 2017

roberthbailey Oct 16, 2017

floreks Oct 18, 2017

maciaszczykm Oct 18, 2017

roberthbailey commented Oct 16, 2017

mikedanese Oct 16, 2017 •

edited

liggitt Oct 16, 2017

floreks Oct 17, 2017 •

edited

liggitt Oct 17, 2017

mikedanese Oct 17, 2017

floreks Oct 18, 2017

liggitt Oct 18, 2017

floreks Oct 19, 2017 •

edited

maciaszczykm Oct 28, 2017

liggitt Oct 31, 2017 •

edited

liggitt commented Nov 28, 2017

liggitt commented Nov 29, 2017

roberthbailey commented Nov 29, 2017

enisoc commented Nov 29, 2017

k8s-github-robot commented Nov 29, 2017

bryk commented Nov 30, 2017

floreks commented Nov 30, 2017

bryk commented Nov 30, 2017

maciaszczykm commented Nov 30, 2017

maciaszczykm commented Dec 1, 2017

deads2k commented Dec 1, 2017

k8s-github-robot commented Dec 1, 2017

k8s-github-robot commented Dec 1, 2017

k8s-github-robot commented Dec 1, 2017

Update Dashboard addon to version 1.8.0 and align /ui redirect with it #53046

Update Dashboard addon to version 1.8.0 and align /ui redirect with it #53046

Conversation

maciaszczykm commented Sep 26, 2017 • edited by jpbetz

maciaszczykm commented Sep 26, 2017

roberthbailey commented Oct 9, 2017

floreks commented Oct 10, 2017

maciaszczykm commented Oct 16, 2017

maciaszczykm commented Oct 16, 2017

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

roberthbailey commented Oct 16, 2017

mikedanese Oct 16, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

floreks Oct 17, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

floreks Oct 19, 2017 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

liggitt Oct 31, 2017 • edited

Choose a reason for hiding this comment

liggitt commented Nov 28, 2017

liggitt commented Nov 29, 2017

roberthbailey commented Nov 29, 2017

enisoc commented Nov 29, 2017

k8s-github-robot commented Nov 29, 2017

bryk commented Nov 30, 2017

floreks commented Nov 30, 2017

bryk commented Nov 30, 2017

maciaszczykm commented Nov 30, 2017

maciaszczykm commented Dec 1, 2017

deads2k commented Dec 1, 2017

k8s-github-robot commented Dec 1, 2017

k8s-github-robot commented Dec 1, 2017

k8s-github-robot commented Dec 1, 2017

maciaszczykm commented Sep 26, 2017 •

edited by jpbetz

mikedanese Oct 16, 2017 •

edited

floreks Oct 17, 2017 •

edited

floreks Oct 19, 2017 •

edited

liggitt Oct 31, 2017 •

edited